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ABSTRACT 

We forecast constraints on cosmological parameters with primary CMB anisotropy informa- 
tion and weak lensing reconstruction with a future post-Planck CMB experiment, the Cosmic 
Origins Explorer (COrE), using oscillation data on the neutrino mass splittings as prior infor- 
mation. Our MCMC simulations in fiat models with a non-evolving equation-of-state of dark 
energy w give typical 68% upper bounds on the total neutrino mass of 0.136eV and 0.098eV 
for the inverted and normal hierarchies respectively, assuming the total summed mass is close 
to the minimum allowed by the oscillation data for the respective hierarchies (O.lOeV and 
0.06 eV). Including geometric information from future baryon acoustic oscillation measure- 
ments with the complete BOSS, Type la supemovae distance moduli from WFIRST, and a 
realistic prior on the Hubble constant, these upper limits shrink to 0.118eV and 0.080eV 
for the inverted and normal hierarchies, respectively. Addition of these distance priors also 
yields percent-level constraints on w. We find tension between our MCMC results and the 
results of a Fisher matrix analysis, most likely due to a strong geometric degeneracy between 
the total neutrino mass, the Hubble constant, and w in the unlensed CMB power spectra. If 
the minimal-mass, normal hierarchy were realised in nature, the inverted hierarchy should 
be disfavoured by the full data combination at typically greater than the 2rj level. For the 
minimal-mass inverted hierarchy, we compute the Bayes' factor between the two hierarchies 
for various combinations of our forecast datasets, and find that the future cosmological probes 
considered here should be able to provide 'strong' evidence (odds ratio 12:1) for the inverted 
hierarchy. Finally, we consider potential biases of the other cosmological parameters from 
assuming the wrong hierarchy and find that all biases on the parameters are below their 1 a 
marginalised errors. 

Key words: cosmology: theory - cosmological parameters - neutrinos - gravitational lensing: 
weak - methods: statistical 



1 INTRODUCTION 

In the most recent extension to the Standard Model of parti- 
cle physics, it has been established that at least two of the 
three neutrino mass eigenstates possess non-zero mass. The Su- 
perKamiokande experiment provided the first evidence for this 
with the detection of flavour oscillations in atmo spheric neutri- 
nos dFukuda et all 1 19981 . 1200 ll : lAshie et alJ l2004n . and the phe- 
nomenon has since been ob served in solar neutrinos b y the Sud- 
bury Neutrino Observatory (Ahmad et al. 2001, 2002), in reactor 
anti-neutrinos at KamLAND (Eguchi et al. 2003), and in accelera- 
tor ne utrinos by K2K and MINOS TAhn et alj200d : lAdamson et al] 
120081) . 

The oscillation experiments not only reveal that neutrinos have 
mass, but that the three mass eigenstates have different masses. Os- 
cillations are only sensitive to differences in the squared masses 
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of the three eigenstates, and not to the overall mass scale. We use 
the centr al values from the g lobal fits in Appendix B of the arXiv 
update to Malto ni et al. (2004): 

m\-m\ = 7.9±£g x 10~ 5 eV 2 

(1) 

\m\-m\\ =2.2+^ x 10~ 3 eV 2 , 

with 3o" confidence levels indicated. Since the sign of — m\ is 
unconstrained, there are two logical possibilities depending on the 
choice of sign. The difference is negligible for large total masses 
(the 'degenerate' scenario, where m\ ~ mi ~ m^). At lower total 
masses, we have a hierarchical situation, with two distinct hierar- 
chies demarcated by the oscillation data: 'normal' (m\ < m2 <C WI3) 
and 'inverted' (1113 <C mi < mi). Measuring the absolute mass scale 
and determining the true hierarchy of neutrinos are key issues in 
neutrino physics. Note that equation ([T} implies a lower limit on the 
total mass for each hierarchy: 0.095eV for inverted, and 0.056eV 
for normal. 
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Cosmology provides a different perspective on this problem, 
essentially by being sensitive to the gravitational effect of neutri- 
nos on the matter and radiation fields across cosmic time. The pri- 
mary CMB temperature (T) and polarisation (P) anisotropies can 
go some way to constraining absolute neutrino masses, but are 
fairly insensitive to realistic (sub-eV) mass scales compared to the 
late -time influence of neutr i no masses on the c lustering of mat- 
ter felgarov & Lahavl 120051 : llchikawa et al.ll2005l) . Tighter limits 
may therefore be obtained by including large scale structure in- 
formation, suc h as the shape of the matter power spectrum and 
cosmic shear dNamikawa et al.l 12010). as well as distance mea- 
sures such as b aryon acoustic oscilla tions (BAO) and Type la 
supernovae (see lAbazaiian et all 1201 ll for a recent review). With 
the seven-year WMAP data alone, £m v < 1.3eV (95% C.L.) as- 
suming a flat universe and fi xed dark energy equation of state 
w = — 1 dKomatsu et alj[20lll) ; this imp roves to £m v < 0.5 8eV 
when including BAO from SP SS DR7 dPercival et alj|2010h and 
a Hq prior of 4% width from iRiess et all d2009l) 7For the same 
model, the most recent constraints from addinggalaxy cluster- 
ing to WMAP are £m v < 0.36eV dde Putter etaUl2012l) . using 
a photometric sa mple of luminous g alaxies from SDSS DR8, and 
£m v < 0.51 eV dSanchez et alj2012l) . using a spectroscopic galaxy 
sample from SDSS DR9 (as part of the ongoing Baryon Oscillation 
Spectroscopic Survey; BOSS). 

Small-scale CMB measurements open up the possibility of 
exploiting the effect of weak gravitational le nsing (WL) of the 
CMB as well as the primary anisotropies (see llxwis & ChallinoJ 
2006 for a review of CMB lensing). CMB photons are deflected 
by large scale structure along the line of sight as they propa- 
gate to us from last-scattering, and these deflections can be recon- 
structed fro m their non-Gaussian imprint in the CMB at small an- 
gular scales dZaldarriaga & Selia3l999t iHufcoOll; iHu & Okamotol 



|2002| ; lOkamoto & Hull2003h . CMB lens reconstruction thus pro- 
vides both geometric and late-time clustering information 'for free' 
and so additional sensitivity to neutrino masses. Since the source 
plane is essentially fixed - the last scattering surface - this method 
is free of uncertainties over source redshifts, as well as having 
the advantage of probing structure formation without the problems 
of bias and redshift-space distortions. Lens reconstruction has re- 
cently been used to measure the power spectrum of the deflection 
field with te mperature data from the Atacama Cosmology Tele- 
scope (ACT;lDas etalJ|201ll) and the South Pole telescope (SPT; 
Ivan Engelen et alj|2012h ! 

Lens reconstruction from the CMB temperature suffers from 
statistical noise due to chance correlations in the unlensed CMB 
that mimic the effect of lensing. This is such that temperature re- 
constructions will never supply cosmic-variance-limited measure- 
ments of the deflection power spectrum for mul tipoles / > 100. 
Polari zation measurements are very helpful here (Hu & O kamotol 
2002), since they intrinsically have more small-scale power and the 
B-mode of polarization is not confused by primary anisotropies. 
In principle, polarization can provide cosmic-variance limited re- 
constructions to multipoles / ~ 500, i.e. on all scales where lin- 
ear theory applies. For this reason, lens reconstruction from polar- 
ization has become an important part of the science case for suc- 
cessors to the Planck satellite, such as the proposed European-led 
Cosm ic Origins Explorer (COrE; The COrE Collaboration et al. 
|2011|) . and the US-led CMBPol (e. g. iBock et alj I2008L 12009^ 
and polarization upg rades to ACT dNiemack et al.l |20ld)~ and 



masses. This raises the exciting possibility of using cosmolog- 
ical observations to constrain not only the absolute mass scale 
but also the hierarchy (in the case of non-degenerate masses). 
A number of important questions then arise. Will future or cur- 
rent CMB T+P+WL experiments be able to determine the hier- 
archy by themselves, or will the inclusion of external data-sets, 
with their inevitable systematic uncertainties be necessary? What 
limits the ability of cosmology to determine the hierarchy, and 
how can these limits be circumvented? Several stud ies have al- 
ready attempted to answer some of these questions ( Ovama et alj 
| 2012l; Ijimenez et alj|2010l ; iDe Bernardis et al.ll2009l ; lsiosanl2006t 
Hannestad 2003). Four of these papers chose to parametrize the 
hierarchy continuously, either by the fraction of the total mass in 
mj, or using A = (m\ —mo,)/'Lm v , with m\ =mi- However, as the 
constraints in equation dQ indicate, the uncertainty on the mass- 
squared differences is small, and provides good evidence that the 
hierarchy is either normal or inverted. In other words, the choice of 
hierarchy is not a continuous parameter, but a discrete one: with an 
oscillation prior, the Zm v -A space breaks up into two disjoint, one- 
dimensional spaces and the simplest way to proceed is to analyse 
any cosmological dataset in both normal and inverted models with 
the masses parametrized by £m v . 

The problem of determining the mass hierarchy is then one of 
Bayesian model selection, rather than parameter forecasting. The 
machinery in this case is provided by the Bayes' factor, which 
quantifies the degree to which different models are favoured by the 
data with respect to one another, when all their associated parame- 
ters are marginalised over. In this work, we investigate the ability of 
future CMB experiments to determine simultaneously the absolute 
neutrino mass scale and the (non-parametric) hierarchy. We first 
calculate forecasts for CMB-only T+P+WL with COrE using both 
Markov-chain Monte Carlo (MCMC) and Fisher techniques, and 
then include geometric infor mation from future BAO constraints 
from the full SPSS-Ill BOSS dSchlegel et al]|2009h. and a Type la 
supernovae survey with WFIRST dGreen et al.ll201lh . We also in- 
clude a future prior on the Hubble constant. We do not consider 
information from clustering in this work, such as the matter power 
spectrum from redshift surveys, number counts, or cosmic shear. 
Our results are thus immune to the inherent systematic uncertain- 
ties in these techniques. 

The paper is organised as follows. In Section|2]we discuss the 
cosmological influence of both the total neutrino mass and the in- 
dividual neutrino masses, and the relevant degeneracies with the 
other cosmological parameters. In Section [3] we discuss our sta- 
tistical forecasting and model selection methodology. In Section|4] 
we introduce the future datasets which will be available as priors 
for future CMB experiments, and we present and discuss our re- 
sults in Section[5] Appendices detail our scheme for protecting de- 
generacies in the construction of the Fisher matrix and discuss the 
sampling errors in estimates of parameter covariance matrices from 
MCMC samples. 



2 COSMOLOGICAL SIGNATURES OF NEUTRINO 
MASSES 



2.1 Massive neutrinos 



and polarization upgrades 
SPT dMcMahon etail 2009) 



Structure formation is mostly sensitive to the summed neu- 
trino mass, but does have some weak sensitivity to individual 



Massive neutrinos have a small, but measurable effect on both the 
primary ani sotropies of the CMB and the growth of structure (for 
reviews see lElgarov & Lah av 2005; Lesgourgu es & Pastorl 120061: 
iHannestadkOld) . We first consider the primary anisotropies . Since 
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Figure 1. Upper: Unlensed CMB temperature power spectra for a model 
with massless neutrinos (dashed red) and degenerate massive neutrinos 
with Y. m v = 0.37eV (solid black). Both models are flat, have the same 
physical densities in cold dark matter and baryons, but different Hub- 
ble constants (Ho = 67.93kms~' Mpc~' for the massive case and Hq = 
71.43kms~' Mpc~' for the massless case) to preserve the angular scale of 
the acoustic peaks. Lower: Fractional difference between the massive and 
massless model. Note that the .v-axis is logarithmic for / < 50 and linear for 
1^50. 



the r.m.s. momentum of a neutrino of mass m v at temperature T v (z 
satisfies 

(pD 1/2 



i 3.22 



k B T v (z) 5.5x10- 



(1- 



(2) 



m v m v (m v /eV) 

neutrinos with mass less than 0.5 eV are still relativistic at recom- 
bination. Their effect on the pre-recombination dynamics of the 
background and perturbations is thus very similar to the massless 
case and so they impact the anisotropies only indirectly through 
the angular diameter distance to last-scattering The ratio 

of cIa(z*) to the sound horizon at last-scattering r s (z*) sets the 
angular scale of the CMB acoustic peaks. For light masses, the 
change in sound horizon is small but, if all other physical densi- 
ties are held fixed, d^(z t ) falls with increasing mass due to the in- 
creased expansion rate at late times. The last-scattering therefore 
appears closer and the anisotropies are shifted to larger angular 
scales. In the flat models considered here, this effect is degenerate 
with a change in either the dark energy density (or, equivalently, the 
Hubbl e constant) or the d ark energy equation of state parameter w 
(Efstathiou & Bo ndll99^) . An example of this degeneracy is shown 
in Fig.[T]for flat models with w = — 1 and either massless neutrinos 
or degenerate massive neutrinos with Y, m v = 0.37eV. These mod- 
els cannot be distinguished on the basis of their (unlensed) spectra. 
Howeve r, since the H ubble constants differ by 4%, current priors on 
Hq (e.g. lRiess et a DUE 1) would effectively break the degeneracy 

between these models. 

The degeneracy is explored in detail in iHowlett et ail J2012I) . 
It is not exact due to a number of physical effects, most notably 
the late-time integrated-Sachs-Wolfe (ISW) effect; see Fig.[T] This 
arises from the late-time decay of the (Weyl) gravitational poten- 
tial + y once dark energy dominates the dynamics of the expan- 
sion. On large scales (and for adiabatic initial conditions) the evo- 
lution of the gravitational potential <j> follows from constancy of the 
comoving-gauge curvature perturbation 



where p to t and p to t are the total density and pressure (including 
contributions from dark energy), Jf? is the conformal Hubble pa- 
rameter and dots denote derivatives with respect to conformal time. 
The metric potential \ff = <j) at late times when anisotropic stresses 
can be neglected. The potential is constant for constant ptot/Ptob 
but evolution in the latter after last-scattering sources the ISW. In 
Fig. [2] we plot ptot/Ptot as a function of the scale factor a for the 
two models considered in Fig. [I] The dominant effect in both cases 
is from the radiation-matter transition and the onset of dark en- 
ergy domination. The former causes <j> + yf to decay around last- 
scattering, sourcing the early-ISW effect which makes a signifi- 
cant contribution to the temperature power spectrum around the 
first acoustic peak. As dark energy dominates, ptot/Ptot — ► — 1 from 
nearly zero during the matter-dominated era causing further de- 
cay of the potentials and sourcing the late-time ISW effect. Mas- 
sive neutrinos change the picture in the following ways. Com- 
pared to massless neutrinos, p v increases as they become non- 
rel ativistic and p v decreases; the onset of this transition is described 
by fcewis & Challinoj|2002l) 



Pv 



Pv 




(4) 



where pj) is the energy d ensity per species of mass less neutrino. In 
the non-relativistic limit (Lewis & Challinorl l2002l) . 
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The ratio ptot/Ptot initially falls more quickly in models with mas- 
sive neutrinos and this leads to enhanced decay of the Weyl poten- 
tial around last-scattering and a larger early ISW effect (see Fig.[TJ. 
If the angular scale of the acoustic peaks and the physical densities 
in cold dark matter and baryons are fixed (the latter preserving the 
pre-recombination physics), the fraction of energy density in dark 
energy is less in models with massive neutrinos and at late times 
Ptot/Ptot falls less slowly towards — 1. This reduces the late ISW 
effect. 

The direct effect of neutrino masses on the pre-recombination 
physics is very small for m v <C 0.5 eV. The enhanced energy den- 
sity reduces the sound horizon and damping scale, with the for- 
mer being compensated by changes in (1a{z*) if we fix the angu- 
lar scale of the acoustic peaks. The neutrino perturbations them- 
selves influence the CMB via the back-reaction from their stress- 
energy on the metric perturbations. The corrections are 0(1 — v v ) 
where v v = 1 — 0(m v /ksT v ) 2 is the typical neutrino thermal ve- 
locity. The size of the effects in the neutrino stress-energy tensor is 
0(m v /kBT v ) 2 py and the relative importance for the metric pertur- 
bations and the CMB is 0(m v /kgT v ) 2 pj) / p tot . For our model with 
Y. m v = 0.37eV, we expect effects at the 0. 1% level consistent with 
the small residual differences at high / in the CMB power spectrum 
shown in Fig.Q] 

We end our discussion of the primary anisotropies by noting 
that the effects of l-10eV masses on the CMB are rather different, 
since the neutrinos are alread y non-relativistic at the time of recom- 
bination (Dodels on et aljl99q) . At these masses, which are already 
ruled out by current upper limits on the total mass, the neutrinos act 
as a hot dark matter component. 
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Figure 2. Upper: Evolution of Ptot/Ptot with scale factor a for the massive 
(solid) and massless (dashed) models in Fig.[T] Lower: Difference between 
Ptot/Ptoi in these models. Also plotted are the relativistic approximation 
from equation l|4j and the non-relativistic approximation from equation (3). 
The differences in Ptot/Ptot at late times are due to the reduced energy den- 
sity in dark energy in the massive model to preserve the angular scale of the 
CMB acoustic peaks. 



The matter po wer spectrum is al so affected by the presence of 
massive neutrinos jBondetalj|l98(]|) . Once non-relativistic, they 
increase the expansion rate over the massless case, but the ten- 
dency of this to impede growth in the clustering of the other matter 
components is mitigated on scales where the neutrinos can clus- 
ter. At any time, neutrinos can cluster on scales larger than their 
proper Jeans (or free-streaming) length, approximately v v (z) / H(z). 
For massless neutrinos, this is simply the particle horizon and the 
comoving Jeans length grows in time in a non-accelerating uni- 
verse. However, for non-relativistic neutrinos, the free- streaming 
length (Lesgourgues & Pastor 20061): 

1 eV\ 



Afs(*)«7.7- 



1+z 



■ } fi- 
tly I 



Mpc, 



(6) 



V^A + ^m(l+z) 3 

where £l m is the current density parameter for matter including 
non-relativistic massive neutrinos. For a non-relativistic transi- 
tion in matter domination, A FS (z) ~a [ / 2 , so the comoving free- 
streaming length decreases in time. Therefore, the comoving free- 
streaming wavenumber [fcpg = 2^a/Aps(z)] has a minimum given 
by the comoving scale of the horizon at the non-relativistic transi- 
tion. On scales larger than this, neutrinos have always clustered and 
their mass has no effect on the matter power spectrum. Below the 
comoving horizon at the non-relativistic transition, neutrinos only 
cluster only after they exit the (shrinking) comoving free- streaming 
length thus slowing down the growth of structure in the intervening 
time. On all scales smaller than the current free-streaming scale, 
massive neutrinos are not clustered at the present time giving a 
scale-free fractional suppression of the matt er power powe r spec- 
trum by roughly -8/ v , where f v = Q. v /Q. m dHu et alJI 19981) . 

For sub-eV neutrino masses, kps today is at roughly the same 
scale that non-linear correc tions to the matte r power spectrum be- 
gin to become important dHannestadl l2010h . Af-body simulations 
indicate a larger suppression of ar ound — 9.6/ v at scales k ~ 0.5- 
l/iMpc -1 , accu rate to about 1% dBra ndbyge & Hannestadl 120091 : 
IViel etalJ boiO). Note that this is considerably smaller than the 
BAO scale (k 0.15/iMpc- 1 )- 

One of the key observables considered in this paper is the re- 
construction of the CMB weak-lensing deflection field from the 



lense d primary anisotropies (for a review, see iLewis & Challinoil 
2006). In linear theory, the power spectrum of the lensing deflec- 
tion angle, Cf d , is a line-of-sight integral over the matter power 
spectrum, and so the effect of massive neutrinos is similar to their 
effect on the growth of large-scale structure. The suppression of 
small-scale power is still the primary effect, as shown in the upper 
plot of Fig. [3] where we plot the fractional change in the lensing 
power spectrum in the presence of massive neutrinos compared to 
the massless case. The total mass is 0.095 eV, the minimum mass 
of the inverted hierarchy. The differences are at the few percent 
level, significantly larger than the intrinsic effect in the unlensed 
CMB anisotropies which is at the 0.01% level (for / ^ 100) for 
such masses. Moreover, the information is complementary since 
the effect in the lensing power spectrum scales roughly with the 
total mass while the intrinsic effect in the unlensed CMB scales 
with the square of the masses. Neutrinos also have a small effect on 
the cross-correlation of the lensing deflection with the CMB tem- 
perature anisotropy (d ue to the late-time ISW effect) and the CMB 
£-mode polarisation dlxwis et alj|201 ll) . and we include both in 
our analysis for completeness. 



2.2 Individual masses 

As we have seen, sub-eV massive neutrinos mostly affect the 
primary CMB anisotropies through their effect on the angular- 
diameter distance d-A (z*). The masses are therefore degenerate with 
other late-time parameters (such as the dark energy model). Even if 
the other late-time parameters are fixed by external distance data, 
there is very little sensitivity to individual masses since dU(z*) is 
determined primarily by the summed mass Y, m v 

Ri93.14fl v /i 2 eV. 

The signature of mass differences amongst neutrinos is thus 
felt mainly through their effect on the growth of structure. Differ- 
ent masses have different free-streaming wavenumbers, and each 
has their own unique signature on the structure formation history 
of the universe. As extremes, if all the mass were in one eigenstate 
the non-relativistic transition would be earlier than if the masses 
were degenerate and the damping in the matter power spectrum 
would not extend to such large scales. On scales smaller than, 
but close to, the horizon size at the non-relativistic transition for 
the degenerate case, the matter power spectrum should therefore 
be smaller for the degenerate case compared to if all mass were 
in one eigenstate. However, this behaviour reverses on scales be- 
low the smaller (i.e. the non-degenerate case) of the free-streaming 
scales at the observed redshift, and the degenerate case would have 
less suppression of power. This is because on such scales the neu- 
trinos would never have clustered since early times and the de- 
generate case would have a later non-relativistic transition and a 
slightly lower neutrino energy de nsity through the extend ed tran- 
sition epoch (see Figs 3 and 4 in [Lesgourgues e t alj|2004h . These 
signatures also show up in the CMB WL power spectrum, which 
roughly reflects the matter power spectrum around z ~ 2. This is 
illustrated in the lower plot of Fig. [3] where we plot the fractional 
difference in the deflection power spectrum compared to the case 
of degenerate masses in the two hierarchies at fixed total mass 
£w v = 0.095 eV. The situation is rather more subtle than the ex- 
treme cases discussed above, since there are effectively two free- 
streaming scales and non-relativistic transitions in the normal hier- 
archy at this total mass. Although cosmic variance (3% at I = 1000) 
dominates the differences in deflection power between the two hier- 
archies at each multipole, the broad-band nature of the signal means 
that we can combine many multipoles to beat down cosmic variance 
(to roughly 0.1% for all multipoles up to / = 1000). 
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Table 1. Fiducial parameters used throughout this work. Note that the den- 
sity parameters for massive neutrinos are close to the values in the minimal- 
mass normal and inverted hierarchies: they correspond to total masses 
Y. m v = 0.062eV (normal) and Y, m v = 0.105eV (inverted). 



Parameter 


Value 


Q b fr 


0.023 


a c h 2 


0.112 


h 


0.703 


T 


0.085 


4s 


2.42xl0~ 9 


n s 


0.966 


Q. v h- (normal) 


0.000666 


Q. v h 2 (inverted) 


0.00113 


w 


-1 


100e(z„) (normal) 


1.03957 


1009( Zt ) (inverted) 


1.04081 



0.2r 




-0.5 L 



10 



1 



10 



10 



Figure 3. Upper. Fractional difference of the lensing power spectrum from 
a scenario with massless neutrinos, for a total mass of 0.095 eV. The sup- 
pression of power on small scales is clearly seen. Lower. Fractional differ- 
ence of the lensing power spectrum from a scenario with degenerate neutri- 
nos, for a fixed total mass of 0.095 eV in the inverted hierarchy (blue) and 
normal hierarchy (green). 



It should therefore be possible, at least in principle, to deter- 
mine the mass hierarchy from a combination of CMB T+P and 
CMB WL observations. Including the oscillation measurements as 
prior information significantly ameliorates this task, as we only 
have to determine a single model from a choice of two, rather than 
deduce the mass splittings purely from cosmology. However, as we 
shall see, parameter degeneracies present a considerable obstacle 
to realising this goal. 

Finall y, we note that it has recently been 
claimed (Wa gner etaT]|2012h that non-linearities roughly double 
the matter power spectrum differences between the hierarchies for a 
fixed total mass appropriate to the mininal-mass inverted hierarchy 
(L m v = 0.095eV) on mildly non-linear scales (k ~ 1/jMpc -1 ). 
However, such small scales are deep in the region where CMB 
lensing reconstructions will always be dominated by statistical 
noise; see Section[33] 



Table 2. Central frequencies V, beam full-widths at half-maximum 9f w hm, 
and temperature and polarization sensitiv ities, Aj and Ap, for the CMB 
channels of COrE jThe COrE Collaboration et al.l201 lb . 



V 


^fwhm 


Ay- 


Ap 


(GHz) 


(arcmin) 


(pK arcmin) 


(^K arcmin) 


105 


10.0 


2.7 


4.6 


135 


7.8 


2.6 


4.5 


165 


6.4 


2.6 


4.6 


195 


5.4 


2.6 


4.5 


225 


4.7 


2.6 


4.5 



3 STATISTICAL METHODS 
3.1 Markov-chain Monte Carlo 

The principal tool we use to forecast parameter constraints is 
Markov-chain Monte Carlo (MCMC). We first generate fiducial 
unlensed CMB spectra, Cf T ', Cj E and Cf E , the lensing deflec- 
and the cross-correlations, Cf d and Cf d , 



tion power spectrum, Cf d , 

using the publicly-available Boltzmann code CAMB Lewis etal] 
2000) with fiducial parameters 9q. We then analyse the mean log- 
likelihood as a function of parameters 8, given by 

\C(8) 



-2(hiP(8\8o)) =Tr[C(0 o )C- 1 (e)]+ln- 



C(6b)| 



-dim(C), (7) 



where C 



the covariance matrix of the data vector d = 
, which consists of the unlensed temperature and E- 
mode polarization, and the reconstructed lensing deflection mul- 
tipoles. Note that C includes instrument noise and the statistical 
noise of the lensing reconstruction. The mean log-likelihood has 
been normalised to zero at the maximum-likelihood point, where 
8 = 8 p. Such a likelihood was considered in e.g. iLewis & Kind 
(2006) in the context of extracting cluster masses from CMB lens- 
ing. 

Our fiducial parameters are given in Table [T] and are 
the ma ximum likelihood es timates from the WMAP 7-year 
release ( Komat su et alj 1201 ll) except for the density parame- 
ters for massive neutrinos which we take to be close to the 
values for the minimal-mass normal and inverted hierarchies. 
We use noise levels appropriate to th e proposed COrE mis- 
sion {The COrE Collaboration et al .1201 ll) ; see Tablel2l We account 
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Figure 4. Power spectrum of the statistical noise on lensing reconstruc- 
tions for COrE using only temperature information (red) or temperature and 
polarization (blue). The linear-theory lensing deflection power spectrum is 
also shown (black), along with the effect of including the non-linear matter 
power spectrum (black dashed). 



crudely for removal of astrophysical foregrounds by using only the 
frequency channels in the range 105-225 GHz and retaining a frac- 
tion /l^y = 0.7 of the sky. We implement the latter approximately 
by reducing the number of degrees of freedom per observable at 
each multipole from 21 + 1 to (21 + l)/ s ky when evaluating equa- 
tion Q. We retain multipoles up to / max = 2500. To compute the 
statistical noise on the lensing de flection reconstruction , we use the 
optimal quadratic estimator of Okamoto & Hu ( 2003); the noise 
power spectrum is shown in Fig.|4]for reconstructions from COrE 
using only temperature or temperature and E and 6-mode polariza- 
tion. For COrE, the E-B estimator dominates the reconstruction. 

Note that we use the unlensed temperature and is-mode polar- 
ization in the likelihood. These are not directly observable but we 
use them as a simple work-around of the double-counting issues 
that may arise when jointly analysing the lensed CMB and lens re- 
construction. 

We impose the prior w ^ — 1 on the dark energy equation of 
state. This represents prior preference for a model in which dark 
energy is described by a field satisfying the weak energy condi- 
tion . Models with w < — 1 do exist, but possess greater complex- 
ity dFang et alj|20 08). and are potentially unstable at the quantum 
level iCarrol l et al. 2003). Since the data do not prefer w either side 
of — 1 , we select the simpler model, w ^ — 1. 

We sample from the likelihood using the publicly-available 
COSMOMC package dLewis & Bridlejl2002l) . modified to include 
the effects of neutrino mass splitting. We use a modi fied version 
of the FUTURCMB lensing add-on for COSMOMC dPerotto et all 
2006) including the small correlat ion between the £ -mode polar- 
ization and the lensing deflection dLewis et alj|201ll) calculated in 
CAMB as Cf d . COSMOMC works natively with the angular scale 
of the sound horizon at recombination, as opposed to h, so 

we include the fiducial value of this parameter in Table [T]for com- 
pleteness. 

When forecasting standard deviations of parameters, we must 
estimate the covariance matrix from the MCMC chains. For those 
parameters whose marginalised posteriors are approximately Gaus- 
sian, this is the quantity that should be compared to the results of 
a Fisher analysis (parameters with hard priors do not fall into this 
category and so do not require standard deviation estimates). How- 



ever, these estimates are inherently statistical in nature. If we wish 
to compare the results of the MCMC analysis to those of a Fisher 
analysis we need to know the typical statistical fluctuation of our 
covariance matrix estimates. This translates into some scatter in 
the eigenvalues and eigenvectors, a matter that we explore in Ap- 
pendix [B] 



3.2 Fisher analysis 

In Section|5] we make some comparisons between parameter con- 
straints obtained with MCMC and those from a Fisher analysis. 
The Fisher information matrix is the Hessian of (minus) the mean 
log-likelihood at the fiducial parameters. To the extent that the 
likelihood is Gaussian in the parameters, constraints derived from 
MCMC exploration of the mean log-likelihood should agree with 
the Fisher analysis. Differentiating equation (|7) with respect to 
the pa rameters gives the Fisher matrix (see, for example iDodelsor] 
120031) 



-Tr 

2 

E E 

l=2XX',YY' 



d6i 



dC 



^-[Cav(XX',Y7 f ) 



(8) 



where Cav(XX' ,YY') is the covariance of the power spectra esti- 
mators, including noise, and XX' and YY' stand for the spectra TT, 
EE, TE, dd, Td, and Ed, with dd the weak lensing power spec- 
trum etc. Assuming Gaussian fields and noise, the specific form of 
Coy {XX 1 , YY') is 



[Cov(XX',YY% = 



(2/ + l)/ sky 



^XY ^X'Y ~XY' ~X ! Y 



(9) 



where / s k y is the fractional sky coverage. The tildes denote the total 
power spectra including instrument noise for XX' = TT and EE 
and reconstruction noise for XX' = dd. The inverse of the Fisher 
matrix gives the covariance matrix between the parameters and its 
diagonal elements give the lo~ marginalised errors on parameters. 

When constructing the Fisher matrix, it is important to use ac- 
curate power spectrum derivatives, since numerical noise in these 
can artificially break degeneracies leading to over-optimistic pa- 
rameter constraints. We found this particularly troublesome for the 
lensing deflection field derivatives. Our brute-force solution was 
to run CAMB at a high accuracy setting (accuracy_boost=5) to 
remove this noise, as it was found not to be due to a bad choice 
of derivative step siz^H In addition, we found it necessary to en- 
force some parameter degeneracies directly in the construction of 
the Fisher matrix. We detail these issues in Appendix [A] 



3.3 Bayesian model selection 

Distinguishing between the two hierarchies is a problem of model 
selection. We can quantify our relative degree of belief in different 
models by use of the Bayes' factor, defined as the ratio of Bayesian 
evidences. Consider two models M and M' (for example, normal 
and inverted hierarchies), with parameter vectors 9 and 9', not nec- 
essarily of the same dimension. Let x be the data vector. The ratio 
of posterior probabilities is 



1 We used a pre-January 2012 version of CAMB for this work. Several im- 
provements in numerical accuracy were made for the January 2012 version; 



see lHowlett et alj 120121) . 
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p{M'\x) _ p(x\M')p(M') 

p(M\x) p(x\M)p(M) ' K } 

where p(M) is our prior degree of belief in the model M and sim- 
ilarly for M'. The evidence p(x\M) is found by marginalising the 
product of the likelihood p(x\ 0,M) and the parameter prior p(6\M) 
over parameters. Assuming no a priori preference for either model, 
p(M) = p(M'), and so the ratio of posterior probabilities becomes 
the Bayes' factor 

jde'p(x\e',M')p(e'\M') 

jde P (x\e,M)p(e\M) ■ ( ' 

Evaluating B involves a difficult multi-parameter integration but 
we may make some analytical progress by approximating the like- 
lihood as Gau ssian, the 'Laplace approximation' (for example, 
lMacKay| |2003). For the case of £m v and w, the prior cuts off the 
posterior at close to the maximum likelihood point, so the integra- 
tion over these parameters must be performed numerically. Starting 
from our MCMC chain samples, we perform a least-squares fit of 
a multi-variate Gaussian to the likelihood. For model selection, we 
parametrize the neutrino masses in terms of the lightest mass, rather 
than the total mass, so that the prior volumes for the two hierarchies 
are equal. We calculate the evidence assuming a uniform prior in 
the lightest mass (and the other parameters) by first integrating ana- 
lytically the Gaussian fit over all parameters except w and the mass. 
The resulting two-dimensional Gaussian is integrated over the prior 
range numerically. The Bayes' factor then reduces to a product of 
the maximum likelihood ratio and an 'Occam factor' in analogy 
to the fully Gaussian case. However, here the Occam factor derives 
from both the Gaussian covariance and the degree to which the like- 
lihood is cut off by the prior. 

We perform model comparison in the case of the minimal- 
mass inverted hierarchy taking the 'data' to be the fiducial power 
spectra (plus noise) calculated in the inverted hierarchy. When anal- 
ysed assuming the normal hierarchy, the likelihood has a local max- 
imum within the prior volume. We locate this point, and the likeli- 
hood there, leaving us to fit only the covariance of the multi-variate 
Gaussian. The Bayes' factor we calculate is the ratio of the evi- 
dences derived from the respective mean log-likelihoods. Although 
this will generally differ from the mean Bayes' factor, we denote 
it by {B) noting that its value should be typical in an ensemble of 
data from the minimal-mass inverted hierarchy. 

The smaller the volume of the likelihood confidence ellipsoid, 
the more finely-tuned the model must be to fit the data. Such mod- 
els are penalised in the Bayes ' factor in favo ur of models which 
do not need such fine-tuning (MacKay 200^). The other term in 
the Bayes' factor, involving the ratio of the maximum likelihoods, 
represents the data's sensitivity to the mass splitting. If cosmol- 
ogy were insensitive to mass splittings, the ratio of maximum like- 
lihoods would always be unity if the true total mass were large 
enough to be realised in either hierarchy. In this case, the Occam 
factor will generally favour the normal hierarchy as the likelihood 
is non-zero over a greater range of the prior volume. 

Ijeffrevsl dl96ll) proposed model selection criteria depending 
on the value taken by the Bayes' factor. If InS > 5, evidence for 
model M' is 'decisive' over model M, if 2.5 < InS < 5 the evi- 
dence is 'strong', and if 1 < InS < 2.5 it is 'substantial'. For a dis- 
cussion of the s uitability of these crite ria, and the usefulness of B 
as a statistic, see lEfstathioul d2008l) and I Jenkins & Peacockl d201ll) . 



4 EXTERNAL DATASETS 

In this section we discuss the various priors from non-CMB 
data that we include in our analysis. For a comprehensive sur- 
vey ofjhejitility_of extern al data in constraining neutrino masses, 
see lAbazaiian et all J201 lb - We consider only a subset of all pos- 
sible probes, since the main focus of this paper is information 
from primary CMB anisotropies with a weak lensing reconstruc- 
tion. CMB experiments offer a relatively clean source of cosmolog- 
ical information at multipoles / < 2000 since the relevant physics 
is simple and well-understood and extra-Galactic foregrounds are 
sub-dominant to the primary CMB fluctuations. 

We only include external geometric probes (Hq, luminosity 
distances from supemovae and BAO) here. Other direct probes of 
the clustering of matter on small scales, such as galaxy cluster- 
ing, galaxy weak lensing, the Ly-a forest and cluster abundances 
are potentially very useful probes of neutrino masses because of 
the scale-dependent growth associated with neutrino free- streaming 
(see Section [2J. However, associated problems such as redshift- 
space distortions, scale-dependent galaxy bias, source redshift un- 
certainties, and the fact that the free-streaming scale lies close to the 
non-linearity scale at low redshift make it interesting to see what 
can be achieved with only the CMB and relatively clean geometric 
probes. 



4.1 Hubble constant 

As we discuss later in Section|5] our Fisher matrix indicates that the 
geometric degeneracy between h, w and neutrino mass is not com- 
pletely broken by lensing. The neutrino mass contributes to this 
degeneracy in a small way, so we might hope to improve our fore- 
casts with a simple prior on Hq. Current precision from the Hubble 
Space Telescope is around the 3% level dRiess et alJuOllh . but for 
the purpose of forecasting we impose a 2% prior, not unrealistic o n 
the time-scale of a mission like COrE (Freedman & Madorel20ld) . 



4.2 WFIRST Type la supernovae 

Distance modulus information from Type la supernovae offers ge- 
ometric information about the universe which can be a useful probe 
of late-time phenome na such as dark energy dRiess et al.lll998l : 
IPerlmutter etai] 1999b . as well as a tool for breaking geometric de- 
gener acies inherent in the CMB ([Efstathiou et al. 1 999 ; LineweaveJ 
Il998h . 

In this work, we forecast distance modulus measurements 
from the Wide-F ield InfraRed Survey Telescope (WFIRST; 
iGreen et"al] l201ll) . the highest ranked recommendation for 
large space-based missions in the 2010 US Decadal Sur- 
vey dDecadal Survey of Astronomy and Astroph ysics 1 12010|) . Ex- 
pected survey characteristics were taken from the WFIRST In- 
terim Reporjj. We assume their 'conservative' figure of merit as- 
sumption, but double the survey time to 12 months. We forecast 
200 supernovae in each of eight redshift bins between z = 0.4 
and z = 1.2, each bin having Az = 0.1. We augment this sam- 
ple with 500 nearby (z < 0.1) supernovae, as forecast by the Fig- 
ure of Merit Science Working Group PaneQ We assume a scatter 



http : //wf ir st . gsf c 


nasa.gov/science/WFIRST_Interim_Report.pdf 


http : //wf ir st . gsf c 


nasa.gov/science/fomswg/fomswg_technical.pdf 
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Table 3. Forecast BOSS errors on and H(z) in three redshifts bins. 



z 


d A (z) 


H(z) 


0.35 


1.0% 


1.8% 


0.60 


1.1% 


1.7% 


2.50 


1.5% 


1.5% 



°M (z) = 0. 1 1 + 0.033z of the apparent magnitudes for each super- 
nova, after light-curve fitting, about the (unknown) absolute mag- 
nitude M. The mean absolute magnitude in each redshift bin then 
has variance 0^ t (z;) = 0^(z;)/iV; + C7^ ys (z;), where AT; is the num- 
ber of supemovae in each bin and o"f ys (z) = 0.02(1 +z)/1.8 repre- 
sents a floor in the scatter due to systematic effects. Our treatment 
is consistent with the WFIRST 'conservative' forecasts. We neglect 
potential biases through use of different light curve fitters, and as- 
sume all such uncertainty is contained in our systematic error. 

We construct the mean log-likelihood after m arginalising over 
the a bsolute magnitude M (see Appendix F of iLewis & Bridle! 
l2002h . Assuming no correlation between redshift bins, we have, 
up to a constant, 



-2<lnP(e|0 o )}=£ 



„ A/x f 



,(12) 



where Ali; = — li(z,;f3o) with li(z,;0) the distance mod- 

ulus at parameters 6, and 6q is the fiducial model. The sums are 
over redshift bins. The second term in equation 11121 arises from the 
marginalisation and ensures that the log-likelihood does not change 
under A/i,- -» Ajlt; + M. 



4.3 BOSS baryon acoustic oscillations 

The sound horizon at the baryon drag epoch, when baryons 
were effectively released from photons, imprints a characteris- 
tic scale in the matter distribution. Observing projections of this 
standard ruler in the galaxy distributi on allows one to map out 
H (z) and aU (z) at a range of redshifts (Blake & Glazebrook1l2003t 
ISeo & Eisensteinl 120031 ; iHu & Haimanl l2003h . Current measure- 
ments of BAO are limited to the spherically-averaged correlation 
function (or power spectrum) which is sen sitive to an effectiv e 



distance \dj(z)/H(z)}^ 3 ; see, for examp le, iBeutler et all feOllh : 
Blake et al. Ander son e for the most recent mea- 

surements. 

A major adva nce in BAO detectio n will come from the com- 
pletion of BOSS l lSchlegel et ail l2009h , part of SDSS-III. This 
should allow separate measurements of the angular diameter dis- 
tance and Hubble parameter in several redshift bins. In Table [3] 
we show the forecast constraints on d^(z) and H(z), taken from 
the SDSS-III Project DescriptiorQ. What is actually measured is 
dA{z)/r s and H(z)r s , where r„ is the sound horizon at the baryon 
drag epoch, and it is this quantity we compute in our MCMC anal- 
ysis. The two are mildly correlated, and we assum e a correlation co- 
efficie nt of 0.4, consistent with the value found in lSeo & Eisensteii] 
d2007h . 



http : / /www. sdss3 . org/collaboration/description.pdf 



Table 4. Parameter errors (lo~) in the two hierarchies, comparing MCMC 
and Fisher matrix results. We assume noise levels appropriate to COrE and 
use only CMB T+P+WL. We quote 68% upper limits on Y. m v and w - 



Inverted 



Normal 





MCMC 




Fisher 




MCMC 




Fisher 




a b h 2 


3.71 x 10" 


5 


3.84 x 10" 


5 


3.62 x 10" 


5 


3.82 x 10" 


5 


dell 1 


4.34 x 10" 


4 


5.60 x 10" 


4 


4.30 x 10" 


4 


5.50 x 10" 


4 


h 


0.014 




0.041 




0.015 




0.042 




T 


2.32 x 10" 


3 


2.33 x 10" 


3 


2.34 x 10" 


3 


2.43 x 10" 


3 


log 10'% 


7.00 x 10" 


3 


8.26 x 10" 


3 


6.83 x 10" 


3 


8.17 x 10" 


3 


n„ 


1.63 x 10" 


3 


1.86 x 10" 


3 


1.62 x 10" 


3 


1.86 x 10" 


3 


I> v (eV) 


< 0.136 




< 0.171 




< 0.098 




< 0.151 




w 


< -0.93 




< -0.87 




< -0.93 




< -0.87 





5 RESULTS 

5.1 Neutrino mass forecasts 

We begin by considering CMB data alone. The results of our 
MCMC runs are displayed in Table [4] along with Fisher matrix 
results for comparison. The forecast 68% upper limit on the total 
neutrino mass with CMB T+P+WL with C0r£-like noise levels 
is 0. 136 eV for the inverted hierarchy, and 0.098 eV for the normal 
hierarchy from the MCMC analysis. Recall that in the fiducial mod- 
els, Y. m v — 0. 105eV in the inverted hierarchy and Y. m v — 0.062eV 
in the normal hierarchy. For Gaussian marginalised posterior distri- 
butions, truncated by the prior on the minimum total mass, these 
upper limits correspond to lo" errors of 0.036 and 0.039eV for 
the inverted and normal hierarchies, respectively. These results 
are consistent with the MCMC forecasts in the COrE White Pa- 
per dThe COrE Collaboration et al.ll201 ll) . They derived 95% up- 
per limits on the lightest neutrino mass of the two hierarchies of 
0.045 eV and 0.034eV for the inverted and normal hierarchies re- 
spectively. Our corresponding values are 0.044eV (inverted) and 
0.039 eV (normal), the slight differences probably being due to the 
slightly different fiducial parameters used. The lo" errors on the 
other parameters are also consistent, with small differences due to 
the different parameter sets used. The marginalised 68 and 95% 
confidence regions for the massive neutrino energy density and the 
other parameters are plotted in Figs|5]and|6] along with Fisher ma- 
trix results. 

Comparing our MCMC results with those from the Fisher ma- 
trix, we note significant discrepancies. The Fisher matrix overesti- 
mates the forecast lo" errors on £m v and w by a factor of two and 
on h by a factor of three, but agrees well with the MCMC results 
for the other parameters. A likely cause of this discrepancy is the 
strong angular-diameter degeneracy between h, w, and Y* m v in the 
unlensed CMB power spectra. This degeneracy is not completely 
broken by the inclusion of the lensing reconstruction and may in- 
troduce some non-Gaussianity into the likelihood, thus violating 
the Fisher approximation. 

An additional source of the discrepancy concerns our chosen 
parameter set. As discussed in Appendix lAl using the (/i,£mv,(f) 
parametrization requires fixing the ratio of certain power spectra 
derivatives to eliminate numerical noise at intermediate and high 
multipoles. The results of the Fisher analysis are sensitive to this 
ratio, which involves derivatives. If instead we replace h 

with 8(z t ) as a parameter in the Fisher analysis, in which case we 
enforce dC[ /dw = at intermediate and high multipoles for the un- 
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Figure 5. Marginalised confidence regions (68 and 95 per cent) between 
the massive neutrino energy density and the rest of the parameter set for 
the inverted hierarchy. Shaded regions are MCMC results, and contours are 
from the Fisher matrix. 



lensed CMB spectra, on transforming back to h we do not recover 
accurately the original Fisher matrix. Sensitivity to such choices is 
clearly unsatisfactory, and is the likely explanation for the differ- 
ences in forecast errors in these parameters. 

A simil ar discrepancy betw een MCMC and Fisher forecasts 
was found in lPerotto et alj |2006) in the context of forecasting for 
Planck-like noise, although their disparity is much smaller than 
ours reported here. In that work, the discrepancies reduced when 
lensing reconstruction was included. It was argued that, since lens- 
ing breaks the main degeneracies in the unlensed CMB, including it 
brings the posterior closer to a multivariate Gaussian and improves 
the agreement between Fisher and MCMC analyses. The same is 
almost certainly true in our forecasts, but is possible that the sig- 
nificantly lower noise levels we have used make the Fisher results 
more vulnerable to numerical effects even with lens reconstruction 
included. Note that we checked that r unning our Fisher a nalysis 
with the Planck-like noise levels used in |Perottoetal.ld2006t) repro- 
duces their neutrino mass fo recasts. It is intere sting that the Fisher 
forecasts on neutrino mass in lKaplinghat et al.l d2003l) from a rather 
more sensitive, higher-resolution CMB satellite are comparable to 
our MCMC results. In particular, they find a 1(7 error of 0.044eV 
assuming two massless and one massive neutrinos. 

With CMB T+P+WL, the constraints on neutrino masses are 
limited by degeneracies with other parameters. To illustrate this, 
we note that the conditional errors on the total neutrino mass using 
G(z*) (rather than h) in a Fisher analysis are 0.0242 and 0.0192eV 
for the normal and inverted hierarchies, respectively. In this param- 
eter set, the conditional information on neutrino masses is domi- 
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Figure 6. As Fig.|5]but for the normal hierarchy. 



nated by Cf . The most relevant degeneracy for neutrino masses is 
with the cold dark matter density, as can be seen in Figs|5]and|6] We 
may understand this as an effect of the lensing potential, since no 
such degeneracy is observed with just the unlensed CMB. Inspec- 
tion of the derivatives of Cf d with respect to YJm v and Q. c h 2 reveals 
broadly similar features. Increasing neutrino mass damps the lens- 
ing potential on small scales, with large scales unaffected, as dis- 
cussed in Section|2] Increasing the cold dark matter density boosts 
the lensing potential on small scales, leaving large scales relatively 
unaffected. To see this, note that increasing Q. c h 2 pushes back the 
epoch of matter-radiation equality to earlier times. For modes that 
are sub-Hubble during radiation domination, the gravitational po- 
tential undergoes oscillations with decaying amplitude until settling 
down to a constant value again well into matter domination. With 
matter-radiation equality earlier, the potential decays less during 
the shorter sub-Hubble radiation-dominated phase and the potential 
on scales smaller than horizon scale at matter-radiation equality is 
increased. On large scales there is no such effect. The change in the 
power spectrum of the gravitational potential appears to dominate 
other changes, such as the mapping from I to k due to the reduced 
distance to typical lenses on increasing Cl c h 2 , in determining the 



effect on Cf . Since the effect of increases in neutrino mass and 
cold dark matter density have opposite sign in the lensing power 
spectrum, the parameters are positively correlated. 

Finally we note that there is little difference in our MCMC 
results between the hierarchies. This is in contrast to the Fisher ma- 
trix results, which give a 35% larger limit on the total mass relative 
to the fiducial value in the normal hierarchy compared to the in- 
verted. However, since we believe the Fisher results are unreliable, 
it is clear that this difference is not significant. 
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Table 5. Upper limits (68%) on the total neutrino mass Y, m v and dark en- 
ergy equation-of-state parameter w for inverted (top) and normal (bottom) 
hierarchies, combining different external datasets (see text for details). 





No priors 


Ho 


WFIRST 


BOSS 


Combined 


Z>v(eV) 


0.136 


0.131 


0.131 


0.119 


0.118 


w 


-0.93 


-0.97 


-0.98 


-0.98 


-0.99 


S> v (eV) 


0.098 


0.095 


0.095 


0.082 


0.080 


w 


-0.93 


-0.97 


-0.98 


-0.98 


-0.99 
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Figure 7. Marginalised confidence regions (68 and 95 per cent) between 
Q. v h 2 , £l c h 2 , and w, with (green) and without (blue) a 2% prior on Hq, for 
inverted (top) and normal (bottom) hierarchies. 

5.2 Inclusion of external datasets 

We now consider to what extent external data can improve the con- 
straints on massive neutrinos by breaking degeneracies in the CMB 
T+P+WL analysis. 

We start with the effect of including a prior of width 2% on 
the Hubble constant. The error on the total neutrino mass (and w) 
is reported in Table [5] The Hubble prior changes this little since 
lensing has broken most of the degeneracy between neutrino mass 
and h that is present in the unlensed CMB leaving a marginalised 
error on h from CMP T+P+WL alone comparable to the width of 
the Hq prior (see Table |4j- The most important degeneracy for neu- 
trino mass is with the cold dark matter, as seen in Figs [5] and [6] 
The inclusion of an Hq prior has little effect on this degeneracy; see 
Fig-El However, the constraint on w is reduced by over a factor of 
2. This is due to the breaking of the degeneracy between h and w, 
a consequence of the geometric degeneracy in the unlensed CMB 
which is not completely removed by the lensing reconstruction. 

When we include forecast observations of Type 1A super- 
novae from WFIRST, we see similar improvements in Y, m v and 
w as when including the Hq prior; see Table|5] After marginalising 
over absolute magnitude, the supemovae data are essentially dis- 
tance ratios and so, out to z = 1, are mostly sensitive in flat models 
to Cl m and w. The density parameter is well constrained by super- 
novae which, when combining with the CMB, sharpens up con- 
straints on h despite the supernovae distance ratios providing no 
direct measure of this parameter. Moreover, w is well constrained 
since distances at low redshift are sensitive to the evolution of dark 
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Figure 8. As Fig. [7] but with the Hq prior replaced by forecast Type 1A 
supernovae data from WFIRST. 



energy. The MCMC results show that the total neutrino mass has 
little degeneracy with the geometric parameters h and w, so the er- 
rors on Y, m v do not change by much with the inclusion of WFIRST 
data. The degeneracy with Q, c h 2 is again preserved, as shown in 
Fig.i 

A greater improvement is seen when forecast BAO data from 
the complete BOSS is included as a prior. The 68% upper limits 
on the summed mass shrink to 0.119eV and 0.082eV for the in- 
verted and normal hierarchies, respectively, and the upper limit on 
w becomes —0.98 for both hierarchies (Table [5}- The outperfor- 
mance of BAO data compared to Type la supernovae is probably a 
combination of effects. Firstly, the supemovae provide only relative 
distance information (i.e. ratios) at low redshifts, which in combi- 
nation with a well-constrained distance measurement to recombi- 
nation from the CMB does not provide as strong a constraint on the 
evolution of the expansion rate as an absolute distance measure- 
ment from BAO. This effect is partly compensated by the fact that 
our BAO measurements are at higher redshifts, closer to the last- 
scattering surface, and so do not provide as long a lever arm as the 
supernoave. Secondly, the BAO measurements probe both the an- 
gular diameter distance and the Hubble rate, thus offering an inter- 
nal consistency check that supernoave do not possess. Finally, our 
forecasted supernovae measurements contain a limiting systematic 
floor to the error budget which is not shared by the BAO. The effect 
of BOSS information is shown in Fig. [9] We see from this plot that 
the degeneracy with Q. c h 2 still remains, although with its strength 
reduced. 

Finally, the combination of all priors gives excellent preci- 
sion on both £m v and w, the 68% upper limits being 0.118eV and 
—0.99 for the inverted hierarchy, and 0.080eV and —0.99 for the 
normal hierarchy; see Tableland also Fig. [l0]for confidence re- 
gions. The corresponding ltr errors assuming truncated Gausssian 
posterior distributions are 0.018 and 0.021 eV. The 95% upper limit 
in the normal hierarchy is 0.103eV (which agrees well with Guas- 
sian extrapolation from the 68% limit). The implication of this is 
that, even if cosmology had no sensitivity to mass splittings but 
only to the total mass, the inverted hierarchy would typically be 
disfavoured at almost the 2a level if neutrinos were indeed in the 
minimal-mass normal hierarchy. 
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Figure 9. As Fig. [7] but with the Hq prior replaced by forecast BAO data 
from BOSS. 
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Figure 10. As Fig.[7]but with all external data included (Ho, Type 1A su- 
pernovae, and BAO). 

5.3 Behaviour with fiducial mass 

We have repeated our MCMC analysis for different fiducial masses. 
In Table [6] we report the 1 a errors on the total neutrino mass for 
various fiducial values. For the fiducial masses less than 0.373 eV, 
the lrj error is computed from the 68% upper limits, assuming the 
marginalised posterior distribution on Y, m v can be approximated 
by a Gaussian truncated by the lower limit on the total mass in the 

Table 6. lo~ error (see text for details) on the total neutrino mass as a func- 
tion of fiducial mass, from MCMC runs including only CMB T+P+WL. 



Fiducial Mass (eV) 


Hierarchy 


cr(£m v ) (eV) 


0.062 


Normal 


0.039 


0.105 


Inverted 


0.036 


0.140 


Inverted 


0.024 


0.373 


Degenerate 


0.037 


0.559 


Degenerate 


0.037 



respective hierarchy. For total masses greater than 0.3 eV, the pres- 
ence of mass splitting becomes unimportant, so we forecast assum- 
ing three degenerate neutrinos, which speeds up the calculations. In 
these cases, we simply quote the standard deviation of the samples. 
The forecasts are for CMB T+P+WL alone. 

We observe only a mild variation of the error with fiducial 
mass, which we interpret as a balance between two competing 
effects. As the mass increases, the distinctive effect of neutrino 
free-streaming on the lensing power spectrum is pushed to smaller 
scales where the reconstruction noise is higher. However, we spec- 
ulate that the error from the unlensed CMB alone on the total mass 
should decrease until the neutrinos are sufficiently massive to be 
non-relativistic as recombination, at which point it should flatten 
ou0 



5.4 Hierarchy biasing 

Given our uncertainty in the hierarchy, how should we estimate 
masses from future data? The Baye sian approach to th is problem 
is that of Bayesian model averaging ( lHoetingetalJl999h . in which 
the posterior probabilities for the parameters in each model are 
weighted according to the posterior probability of each model. This 
correctly propagates model uncertainty into parameter errors. In the 
limit that the evidence for one model is overwhelming, Bayesian 
model averaging reduces to parameter estimation from that model. 
In the case that the posterior probabilities for the parameters are 
similar in each model, averaging is equivalent to using either model 
alone. 

It is therefore interesting to see how the posterior probabilities 
of the parameters differ between the two hierarchies. To this end, 
we forecast posteriors adopting the normal hierarchy for fiducial 
data taken in the inverted hierarchy (with £m v = 0.105eV. Note 
that this fiducial total mass can realised in both hierarchies. For 
CMB T+P+WL, we find no significant bias in any of the cosmo- 
logical parameters as a result of this procedure. The largest bias is 
in 9(z*), its maximum likelihood value being shifted from its 'true' 
value by 0.76rr. When all priors from external data are included, the 
greatest bias is in w which is shifted by 0.68c 

Our findings about bias are in contrast to the results of 
iDe Bernardis et aO d2009t) . who found shifts in parameters compa- 
rable to their 1 rj errors, with a significant shift in the value of £m v • 
Direct comparisons are difficult since they use a galaxy weak lens- 
ing survey instead of a CMB lensing reconstruction. However, the 
main source of discrepancy is likely due t o the parametrization of 
the hierarchy in lDe Bernardis et al.l d2009l) . which was found to be 
strongly degenerate with £m v . 

5.5 Distinguishing the hierarchies 

Distinguishing the hierarchies is properly a question of model se- 
lection. We discuss this for the two cases of fiducial models that 
are the mimimal-mass normal hierarchy and the minimal-mass in- 
verted hierarchy. 

If the masses are in the normal hierarchy, sensitivity to mass 
splittings is not required to rule out the inverted hierarchy if the true 



5 For such large masses, the effect on the CMB spectra is roughly linear in 
the total mass. However, for light masses, their effect on scales / > 500 for 
fixed 9(zt) is roughly quadratic in the mass (see Section [5) which would 
give a constant error in the square of the mass. 
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Table 7. Values of In (B) for the inverted to normal hierarchies when the 
fiducial model is the inverted hierarchy (with Y. m v — 0.105eV). The values 
are for T+P+WL plus the indicated external data. 



No priors Hq 


WFIRST 


BOSS 


Combined 


-1.08 2.66 


2.53 


2.53 


2.51 



total mass is low enough and the observations have sufficient sen- 
sitivity to the total mass. With no sensitivity to mass splittings, the 
likelihoods for the two hierarchies would necessarily be equal at 
the same total mass (and all other parameters) but the lower bound 
on total mass in the inverted hierarchy would exclude the region 
of high likelihood giving significant odds in favour of the (correct) 
normal hierarchy. Of course, sensitivity to mass splittings would 
reduce the likelihood in the inverted hierarchy compared to the nor- 
mal, further favouring the latter. 

We do not consider the minimal-mass normal hierarchy in de- 
tail here since our Gaussian-fitting to the likelihood is likely to be 
very noisy when the prior range excludes the peak of the likeli- 
hood. However, we note that lower limits on the Bayes' factor can 
be inferred from the results of Section [5]by assuming no sensitiv- 
ity to mass splittings. The forecast constraint on total mass using 
all external data then gives an odds ratio of 15:1 in favour of the 
normal hierarchy in the most favourable case of the minimum to- 
tal mass (0.056 eV). In our forecasting, the peak of the likelihood 
when analysing with the correct hierarchy is necessarily at the true 
parameter values. Under one-sigma fluctuations of the peak down- 
wards and upwards, due to cosmic variance and measurement error, 
the odds ratios vary between 70:1 and 4:1. 

For the case of the inverted hierarchy, we compute typical 
Bayes' factors with the approximations discussed in Section [3731 
Our results for ln{6) are shown in Table|7] where the Bayes' fac- 
tor B is the ratio of evidence for the inverted to normal hierarchies. 
With no external data, the Bayes factor is negative, indicating that 
the normal hierarchy would be preferred by the data even though 
the actual model is inverted. As noted in Section [3731 this arises 
because the Occam factor favours the normal hierarchy when the 
likelihoods have only weak sensitivity to mass splittings and the 
fiducial total mass can be realised in both hierarchies. We have ver- 
ified that the other factor in B, the ratio of maximum likelihood 
values, is sub-dominant compared to the Occam factor. 

The addition of geometric priors to the CMB data pushes 
the Bayes' factors into the positive 'strong' evidence regime with 
odds ratios around 12:1 in favour of the (correct) inverted hierar- 
chy. Again, the Occam factor dominates the likelihood ratio (which 
equals e 023 ) but now favours the inverted hierarchy, due to small 
differences in the shapes and peak positions of the likelihoods for 
the two hierarchies. We note that our results for the Bayes' fac- 
tors wifh_external geom etric data are comparable to those reported 
by Ijimenez et alj d2010l) for a full-sky, cosmic-variance-limited 
galaxy weak lensing measurement with median redshift 3, although 
our treatment of the mass hierarchies is rather different to theirs. 

Note that to calculate our Bayes' factors, we have to calculate 
the value of the maximum likelihood when an inverted model is 
analysed as if it were normal. Since we simply use the maximum- 
likelihood included in the MCMC chains, there is some error intro- 
duced. Since the true value cannot exceed unity, and we typically 
find minimum y} values around 0.4, the logarithm of the Bayes' 
factors could be lower by up to 0.2. One consequence of this is 



that there is little significance in the anomalously low Bayes' factor 
found in the 'all priors' case. 



6 CONCLUSIONS 

We have forecast constraints on cosmological parameters including 
light neutrino masses with future CMB temperature, polarization 
and weak-lensing-reconstruction information from a post-Planck 
space-based experiment such as COrE. Constraints on mass split- 
tings from oscillation data were used as a prior to delimit two possi- 
ble mass hierarchies, normal and inverted. We performed MCMC- 
based forecasts for fiducial models with masses close to the min- 
imal masses in the normal and inverted hierarchies (specifically, 
we took Y. m v = 0.062eV for normal and £m v = 0. 105eV for in- 
verted). Our results show that the COrE mission alone could place 
68% upper bounds on the total neutrino mass of 0.098 and 0. 136eV 
for the normal and inverted hierarchies. For Gaussian marginalised 
posterior distributions truncated by the prior on the minimal mass 
in each hierarchy, these would correspond to 1 o~ errors of 0.039 and 
0.036eV. 

We found good agreement between our MCMC results and a 
Fisher matrix forecast for all parameters except the neutrino mass, 
equation-of-state of dark energy w and the Hubble parameter h, 
for which the Fisher matrix overestimates the errors by at least a 
factor of two. These parameters are highly degenerate in the un- 
lensed CMB spectra for very light masses with only the combina- 
tion that enters the angular scale of the sound horizon at recom- 
bination, 6(z*), well constrained. Any independent information in 
the unlensed CMB spectra on intermediate and small scales con- 
strains combinations of squared-masses as the minimal-masses are 
approached, these being the leading corrections to the neutrino ve- 
locity, energy density and pressure in the ultra-relativistic limit. 
This dependence on mass further violates the Gaussianity assump- 
tion made in Fisher forecasts when the total mass is used as a pa- 
rameter with ultra-light neutrinos. The £m v -w-/z degeneracy is not 
fully broken by the lensing reconstruction. As a result, our Fisher 
results proved rather unstable to changes in parametrisation and 
details of the method of calculation of power spectra derivatives. 
We therefore advise caution when using Fisher matrices to forecast 
constraints with ultra-light neutrino masses such as those consid- 
ered in this work. 

We have also considered the addition of forecast geometric in- 
formation from BAO from the complete SDSS-III BOSS, and Type 
la supernovae distance moduli from WFIRST, as well as a future 
2% constraint on the Hubble constant. These are relatively 'clean' 
probes, free from the problems of biasing and non-linearity inher- 
ent in using galaxy clustering information directly, and from instru- 
mental effects such as from variations in the point-spread function 
in cosmic shear. The distance information offered by these probes 
brings down the 68% upper limit on the total neutrino mass to 0. 1 18 
and 0.08 eV for the inverted and normal hierarchies (corresponding 
to Gaussian lo~ errors of 0.018 and 0.021 eV). We also find percent- 
level precision on the dark energy equation of state parameter w. 
We have found that the BAO data is the most effective geometric 
probe when combined with CMB T+P+WL. This is likely due to 
a combination of effects. Compared to supernovae, the BAO mea- 
surements provide absolute distance measurements and measures 
of the Hubble rate. BAO are at higher redshift, which provides less 
of a lever arm with the angular-diameter distance to last-scattering 
from the CMB than supernoave, but with the low redshift end not 
extending so far into dark-energy domination (which confuses neu- 
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trino mass determinations from CMB -calibrated relative distances). 
Moreover, our supernovae forecasts include a limiting systematic 
floor to the errors which is not the case for BAO. 

The precision achieved on the total neutrino mass when all our 
datasets are combined would be sufficient to disfavour the inverted 
hierarchy at typically greater than the 2o~ level if neutrinos were in 
the minimal-mass normal hierarchy. For the alternative case of the 
inverted hierarchy, we approximated the Bayes' factor with an ex- 
tension of the 'Laplace' method to deal with the prior boundaries. 
With CMB T+P+WL alone, and a fiducial model with total mass 
0.105eV close to the minimum value of 0.095 eV, the Occam fac- 
tor (which gives weight to models whose parameters do not need 
to be so tightly constrained to fit the data) leads to weak favouring 
of the (wrong) normal hierarchy over the inverted with 3:1 odds. 
This situation is overturned by including the geometric datasets, in 
which case we typically find odds ratios of 12: 1 ('strong' evidence) 
correctly in favour of the inverted hierarchy. The best prospect for 
distinguishing the hierarchies is for these minimal-mass configu- 
rations - mass splittings and their orderings become irrelevant for 
cosmological observables at higher masses. 

We also ran MCMC analyses to calculate potential biases on 
parameters by analysing data assuming the wrong hierarchy. We 
found no biases greater than 1 o~ when analysing an inverted model 
while a ssuming it to be a normal m odel. This is in contrast to the re- 
sults of De Ber nardis et Z]|2009), although direct comparisons are 
difficult since they forecast for different datasets and parametrize 
the hierarchies differently. We note that a straightforward solution 
to dealing with such potential biases is to perform Bayesian model 
averaging. This would correctly propagate model uncertainty into 
parameter errors. 

Our results show that CMB lensing, combined with priors on 
mass splittings from oscillation data and external geometric data, is 
a promising route to determining whether neutrino masses are hier- 
archical and, if they are, the ordering of the mass eigenstates. CMB 
lensing provides a relatively clean measure of the effect of neu- 
trino masses on the clustering of matter below the free-streaming 
scale. However, even in the most optimistic scenario we consider, 
the evidence for either hierarchy will never be very strong for these 
cosmological probes. We would expect our evidence ratios to in- 
crease with the addition of other measures of broad-band power in 
the matter power spectrum, although at the risk of bringing in a host 
of other systematic effects. One particularly interesting data com- 
bination to consider further is combining CMB lensing and cos- 
mic shear tomography. This is free of the issues of galaxy bias, but 
non-linear effects will be important for cosmic shear and motivates 
further studies o f the effects of individual masses on the non-linear 
power spectrum (Wagner et al. 201^). 
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APPENDIX A: PROTECTING DEGENERACIES IN THE 
FISHER MATRIX 

In constructing the Fisher matrix, we found it necessary to protect 
several near-exact degeneracies at intermediate and high /. We did 
this for the well-known T-A s degeneracy by enforcing a propor- 
tionality between the appropriate derivatives. This was also done 
for w-h, where the effect on the unlensed TT, EE, and TE spectra 
is purely through the angular diameter distance at small scales. The 
necessity of taking one-sided derivatives for w (w > —1) can cause 
a small phase shift with respect to the ^-derivative, which can break 
degeneracies: a simple one-sided derivative at a point actually gives 
the second-order-accu rate derivative about th e mid-point. A similar 
problem was noted in Eise nstein et al.l d 19991) when differentiating 
with respect to curvature. This makes determining the proportion- 
ality constant between the h and w derivatives of the power spectra 
at high / from the ratio at some intermediate multipole problem- 
atic. Even with a one-sided derivative accurate to second order (us- 
ing appropriate finite-difference coefficients) this problem did not 
disappear. Instead, we chose to enforce the degeneracy by com- 
puting the derivatives of dU(z*) with respect to w and h, and us- 
ing their ratio as the proporti onality constant betwe en dC/ /dw and 
dCi/dh (see Equation B3 of Eisenste in et al J 19991) . This was done 
for / >100, with the precise multipole chosen for a smooth transi- 
tion. In addition, this enforcement was made across the TT, EE, 
and TE spectra by using the same proportionality constant in each 
case, since the physics governing the degeneracy is in each case the 
same. Our Fisher results are fairly insensitive to the enforcement 
of the h-w degeneracy, and the choice of multipole (/ ~100) above 
which it is enforced. 

No such degeneracy was enforced for ~Lm v -h or ~Lm v -w, since 
an inspection of the Zm v derivative of the CMB spectra at fixed 
d\ (z*) revealed a non-zero signal at small scales which did not dis- 
appear when more accurate derivatives (4th-order truncation error 
with large step sizes to beat down numerical noise) were imple- 
mented. This signal is the direct contribution of massive neutrinos 
to the pre-recombination physics discussed in Section [2] It is very 
small, but large enough partially to break the geometric degeneracy 
with h and w. 

Double-sided derivatives were used for each parameter except 
w where the phantom (w < — 1) regime is not considered and for 
which we used a second-order-accurate, one-sided derivative. A 
step size of 5% was used in computing the derivatives, which rep- 
resents a co mpromise between numerical noise and Taylor series 
convergence jPress et alj2007l) . We checked the stability of our re- 
sults to different step sizes, and found variation of at most 10% in 
our marginalised errors with the high accuracy settings used to run 
CAMB. 



APPENDIX B: COVARIANCE MATRIX ESTIMATION 

To compare the statistical quantities inferred from MCMC chains to 
those of a Fisher analysis, it is useful to have some idea of the mag- 
nitude of the statistical fluctuations in the former. For a chain with a 
large number of uncorrelated samples, we would expect these fluc- 
tuations to be small, but it is important to test whether the scatter is 
small enough to resolve any sharp degeneracies present in the like- 
lihood. In this section, we investigate the effect of statistical fluc- 
tuations on the eigenstructure of the empirical covariance matrix 
calculated from the chains. 

We explore this issue by generating random realisations of a 
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covariance matrix, with the mean matrix given by the output of a 
Fisher analysis. We use the parameters in Table[TJ but now addition- 
ally include the primordial helium abundance Y}i e as a parameter 
with fiducial value 0.24. 

We assume that each covariance matrix has been estimated 
from a realisation of N independent parameter samples from a 
Gaussian posterior, {6f}"=\ "q '■ The unbiased estimator for the co- 
variance matrix is then 

1 N 

c u = j7— l L( e ;'-W e j-ej)> CBi) 
«— i 

where 6,- is the empirical mean of the 9" . An application of Wick's 
Theorem then gives the covariance of these estimators 

((Cy -Qj)(C kl -C u )) = -^-j(C ik Cji+C u Cj k ), (B2) 

where Cy is the 'true' covariance matrix of the posterior from 
which the samples are drawn (we take Cy from the output of our 
Fisher analysis). We now assume that the estimator Cy is Gaussian 
distributed, which is true for large N by the central-limit theorem. 
Using our fiducial covariance matrix Cy, we draw random reali- 
sations of Cy from a Gaussian distribution with mean Cy and co- 
variance given by equation dB2t . and study the scatter in its eigen- 
structure about the mean. The results are displayed in Figs IB 1||B2I 
and|B3]for 1000 realisations, with N = 15000. A typical Markov 
chain might be around 30000 samples long, but since these are cor- 
related, we have implicitly thinned the chain by a factor of two. 

Our results show that, in this application, the highest and low- 
est eigenvalues of the covariance matrix may be determined with 
percent-level accuracy with 15000 samples. In Fig. IB3I we show 
the distribution of the neutrino mass component of the most poorly 
constrained eigenvector. In our fiducial model, this eigenvector is 
responsible for most of the neutrino mass variance. We see that the 
degeneracy direction most relevant for neutrino mass is constrained 
at the percent level. 

We conclude from this exercise that MCMC covariance ma- 
trix estimation is sufficiently robust to the effects of finite sample 
size for our purposes. It remains to be seen how more degenerate 
fiducial models may be under-sampled by Monte Carlo techniques. 
This might be relevant to models with a redshift-dependent dark en- 
ergy equation-of-state, where we might expect strong degeneracies 
in the posterior. 




Fractional Variation 



Figure Bl. Histogram of fractional variations in the lowest eigenvalues 
of the empirical covariance matrices in an ensemble of 1000 matrices. 
Each empirical covariance matrix has statistics appropriate to it being esti- 
mated from 15000 independent samples from a Gaussian posterior in nine- 
dimensional parameter space with covariance given from a Fisher analysis. 
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Fractional Variation 



Figure B2. As Fig. lBll but for the highest eigenvalue. 
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Fractional Variation 

Figure B3. As Fig. IB II but for the neutrino mass component of the most 
poorly constrained eigenvector. This component is the dominant source of 
the marginalised neutrino mass error in this case. 



